rank | frequency | n-gram |
---|---|---|
1 | 10948 | -s |
2 | 6687 | -e |
3 | 5201 | -d |
4 | 4037 | -y |
5 | 3715 | -g |
rank | frequency | n-gram |
---|---|---|
1 | 3871 | -ed |
2 | 3565 | -ng |
3 | 2394 | -es |
4 | 1723 | -on |
5 | 1678 | -er |
rank | frequency | n-gram |
---|---|---|
1 | 3489 | -ing |
2 | 1218 | -ion |
3 | 956 | -ted |
4 | 823 | -ers |
5 | 629 | -ent |
rank | frequency | n-gram |
---|---|---|
1 | 1009 | -tion |
2 | 768 | -ting |
3 | 464 | -ions |
4 | 373 | -ated |
5 | 356 | -able |
rank | frequency | n-gram |
---|---|---|
1 | 668 | -ation |
2 | 366 | -tions |
3 | 231 | -ating |
4 | 173 | -mmary |
5 | 144 | -ional |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings